Skip to content

Conversation

@joshuayao
Copy link
Collaborator

@joshuayao joshuayao commented Apr 2, 2025

Add v1.3 release notes.

@joshuayao joshuayao requested review from hshen14 and kding1 April 2, 2025 08:54
@joshuayao joshuayao added this to OPEA Apr 9, 2025
@joshuayao joshuayao added this to the v1.3 milestone Apr 9, 2025
@joshuayao joshuayao added WIP documentation Improvements or additions to documentation labels Apr 9, 2025
@joshuayao joshuayao moved this to In progress in OPEA Apr 9, 2025
@joshuayao joshuayao marked this pull request as ready for review April 9, 2025 08:23
@joshuayao joshuayao removed the WIP label Apr 9, 2025
@joshuayao joshuayao moved this from In progress to In review in OPEA Apr 9, 2025
@joshuayao joshuayao added the v1.3 label Apr 10, 2025
@joshuayao joshuayao requested a review from Copilot April 11, 2025 07:26
Copy link
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Copilot reviewed 2 out of 2 changed files in this pull request and generated 1 comment.

Comments suppressed due to low confidence (2)

release_notes/v1.3.md:2

  • The TODO placeholder for the number of pull requests may confuse users. Consider updating this placeholder with the final count before release.
We are excited to announce the release of OPEA version 1.3, which includes significant contributions from the open-source community. This release addresses over 400(TODO: the number is subject to change) pull requests.

release_notes/RELEASE.md:18

  • [nitpick] The date format seems inconsistent with other entries (e.g., using 'Apr' instead of 'July'). Consider standardizing abbreviations to maintain consistency in the release schedule.
| 1.4     | July 2025    |

Copy link
Collaborator

@mkbhanda mkbhanda left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Does AMD want to add to the table of supported models on serving frameworks and hardware?
Arc not mentioned in table, do we want to add?

Copy link
Contributor

@Yu-amd Yu-amd left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

AMD team has a validated model matrix we can provide.

@Yu-amd
Copy link
Contributor

Yu-amd commented Apr 22, 2025

AMD team has a validated model matrix we can provide.
| Model | TGI-Gaudi | vLLM-CPU | vLLM-Gaudi | vLLM-ROCm | OVMS | Optimum-Habana | PredictionGuard |
| ------------------------------------------- | --------- | -------- | ---------- | --------- | -------- | -------------- | --------------- |
| deepseek-ai/DeepSeek-R1-Distill-Llama-8B | ✓ | - | ✓ | ✓ | - | ✓ | - |
| deepseek-ai/DeepSeek-R1-Distill-Llama-70B | ✓ | - | ✓ | ✓ | - | ✓ | - |
| deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B | ✓ | - | ✓ | ✓ | - | ✓ | - |
| deepseek-ai/DeepSeek-R1-Distill-Qwen-7B | ✓ | - | ✓ | ✓ | - | ✓ | - |
| deepseek-ai/DeepSeek-R1-Distill-Qwen-14B | ✓ | - | ✓ | ✓ | - | ✓ | - |
| deepseek-ai/DeepSeek-R1-Distill-Qwen-32B | ✓ | - | ✓ | ✓ | - | ✓ | - |
| deepseek-ai/Deepseek-v3 | ✓ | - | ✓ | ✓ | - | ✓ | - |
| Hermes-3-Llama-3.1-8B | - | - | - | ✓ | - | - | ✓ |
| ibm-granite/granite-3.2-8b-instruct | - | - | ✓ | ✓ | - | | - |
| Phi-4-mini | x | x | x | ✓ | x | ✓ | - | | Phi-4-multimodal-instruct | x | x | x | ✓ | x | ✓ | - |
| mistralai/Mistral-Small-24B-Instruct-2501 | ✓ | - | ✓ | ✓ | - | ✓ | - |
| mistralai/Mistral-Large-Instruct-2411 | x | - | ✓ | ✓ | - | ✓ | -
|
Phi-4-mini works with latest docker image: rocm/vllm:rocm6.3.1_instinct_vllm0.8.3_20250410

ROCm_Model_matrix

@joshuayao Please see the table above, which includes a column of validated models on ROCm.

Thanks!

Copy link
Collaborator

@ashahba ashahba left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM!

@joshuayao
Copy link
Collaborator Author

AMD team has a validated model matrix we can provide.

Model TGI-Gaudi vLLM-CPU vLLM-Gaudi vLLM-ROCm OVMS Optimum-Habana PredictionGuard
deepseek-ai/DeepSeek-R1-Distill-Llama-8B - - -
deepseek-ai/DeepSeek-R1-Distill-Llama-70B - - -
deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B - - -
deepseek-ai/DeepSeek-R1-Distill-Qwen-7B - - -
deepseek-ai/DeepSeek-R1-Distill-Qwen-14B - - -
deepseek-ai/DeepSeek-R1-Distill-Qwen-32B - - -
deepseek-ai/Deepseek-v3 - - -
Hermes-3-Llama-3.1-8B - - - - -
ibm-granite/granite-3.2-8b-instruct - - - -
Phi-4-mini x x x x -
mistralai/Mistral-Small-24B-Instruct-2501 - - -
mistralai/Mistral-Large-Instruct-2411 x - - -

|
Phi-4-mini works with latest docker image: rocm/vllm:rocm6.3.1_instinct_vllm0.8.3_20250410

ROCm_Model_matrix @joshuayao Please see the table above, which includes a column of validated models on ROCm.

Thanks!

Hi @Yu-amd Updated. Thanks for your information.

Copy link
Contributor

@Yu-amd Yu-amd left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM. Thank you.

Copy link
Contributor

@eero-t eero-t left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Sorry, I had typos in my suggestions.

Co-authored-by: Eero Tamminen <eero.t.tamminen@intel.com>
@joshuayao
Copy link
Collaborator Author

Sorry, I had typos in my suggestions.

No problem. Thanks for the updates.

@joshuayao joshuayao requested a review from eero-t April 25, 2025 04:44
Copy link
Contributor

@eero-t eero-t left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I guess "Other Notable Changes" section content came directly from the PR titles, but for the release notes I think it would good to use correct & consistent capitalization for the names (and maybe also monotype for function names).

Following terms seemed to have instances with incorrect capitalization:

  • AgentQnA
  • Dataprep
  • DeepSeek
  • Docker
  • Helm
  • LlamaGuard
  • LVM
  • Milvus
  • MySQL
  • OpenAI
  • PubMed
  • Python
  • RAG
  • TEI
  • vLLM
  • Xeon
  • YAML

Co-authored-by: Eero Tamminen <eero.t.tamminen@intel.com>
@ftian1 ftian1 merged commit 3f931d7 into opea-project:main Apr 25, 2025
4 checks passed
@github-project-automation github-project-automation bot moved this from In review to Done in OPEA Apr 25, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

documentation Improvements or additions to documentation v1.3

Projects

Status: Done

Development

Successfully merging this pull request may close these issues.

9 participants